# Pattern 1: This huge regex just kills a generic list of stopwords 
(aback|aboard|above|abroad|according|accordingly|across|actually|affv|afield|afore|after|afterwards|again|against|ahead|ain.?t|albeit|alike|all|allow|allows|almost|alone|along|alongside|aloud|already|alright|also|although|always|amid|amidst|amiss|among|amongst|amoungst|amount|and|and\/or|anear|anenst|anent|another|any|anybody|anyhow|anyone|anything|anyway|anyways|anywhere|apache\/1|apart|appear|appreciate|appropriate|april|are|aren|aren.?t|around|array|ashore|aside|asking|asleep|associated|astraddle|atween|august|available|away|awful|awfully|back|backward|barefoot)$
^(barely|bating|became|because|become|becomes|becoming|been|before|beforehand|begin|beginning|behind|being|believe|below|beneath|beside|besides|best|better|between|betwixt|beyond|bill|billion|blog|blog.?archive|blog.?roll|blogs.?that.?link.?here|bodily|bold|both|bottom|brief|bright|but|c.?mon|call|came|can|can.?t|cannot|cant|caption|cause|causes|center|certain|certainly|changes|cheap|clearly|com|come|comes|computer|con|concerning|consequently|consider|considering|contain|containing|contains|continue|contra|copy|corresponding|could|couldn.?t|couldnt|course|create|create.?a.?link|cry|currently|damned|date|deadly)$
^(december|definitely|describe|described|despite|detail|didn|didn.?t|different|directly|discussion|displaymodefull|document.?write|does|doesn|doesn.?t|doing|don.?t|done|down|downward|downwards|dtd|due|duly|during|e\.?g\.?|each|early|edit.?post|eight|eighty|either|eleven|else|elsewhere|empty|ending|enough|entirely|equiv|ere|erg.?o|especially|etc|even|ever|every|everybody|everyone|everything|everywhere|exactly|example|except|excepting|external.?link|fairly|false|farther|february|feed|fer|few|ffffff|fifteen|fifth|fifty|fify|fill|find.?|first|firstly|five|float|followed|following|follows|font|for|forasmuch)$
^(forasmuchas|fore|foremost|forever|former|formerly|forth|forty|forwards|forwhy|.*found.*|four|free|fresh|from|front|full|fully|further|furthermore|gaily|general|get|gets|getting|ghastly|gingerly|give|given|gives|goes|going|gone|good|gotten|great|greetings|had|hadn.?t|happen|happens|hard|hardly|has|hasn|hasn.?t|hasnt|have|haven|haven.?t|having|he.?d|he.?ll|he.?s|hello|help|hence|her|here|here.?s|hereafter|hereby|herein|hereof|hereon|hereto|hereupon|herewith|hers|herself|hidden|high|him|himself|his|hither|hitherto|home|homepage|hopefully|hourly|how|how.?s|howbeit|howe.?er|however|howsoever|href|htmlview)$
^(http|hundred|i.?d|i.?ll|i.?m|i.?ve|i.?e.?|ignored|immediate|immediately|inasmuch|inasmuchas|inc|inc.?|indeed|indicate|indicated|indicates|indoors|information|inner|insofar|instead|interest|into|inward|inwardly|isn.?t|it.?d|it.?ll|it.?s|its|itself|january|join|jolly|july|june|just|keep|keeps|kept|keywords:|kindly|know|known|knows|laquo|large|largely|last|last.?100|last.?20|last.?50|late|lately|later|latter|latterly|least|left|less|lest|let.?s|like|liked|likely|likewise|link|little|lively|register|long|look|looking|looks|loose|loud|ltd|made|main|mainly|make|makes|many|march|margin|mark|may|maybe|mean)$
^(meantime|meanwhile|merely|meta|might|mighty|mill|million|mine|miss|more|more.?here|morehttp|moreover|most|mostly|move|moved.?here|moved.?object|movedthis|much|must|mustn.?t|myself|name|namely|near|nearby|nearly|necessary|need|needs|neither|never|nevertheless|new|newly|next|nine|ninety|nobody|none|nonetheless|noone|nor|normally|not|nothing|notwithstanding|novel|november|now|nowadays|nowhere|object|object.?moved|obviously|october|off|offshore|often|okay|once|one|one.?s|ones|only|onto|opuscule|original|original.?message|other|others|otherwise|ought|our|ours|ourselves|out|outright|outside|outwith|over|overall)$
^(overflow|own|padding|page|part|particular|particularly|partly|per|perhaps|permalink|permanent|placed|please|plus|poorly|posh|possible|post|posted|powerful|presumably|pretty|printer.?friendly|probably|proper|proud|provided|provides|providing|publicly|put|putting|quick|quite|quoties|raquo|rarely|rather|read.?more|readily|really|reasonably|recent|recently|regarding|regardless|regards|register|registerwidget|relatively|reputed|reserved|respectively|right|rightly|ring|said|same|save|saving|saying|says|scant|scarce|scarcely|second|secondly|see|seeing|seem|seemed|seeming|seems|seen|seldom|self|selves|send.?to.?a.?friend)$
^(sensible|sent|september|serious|seriously|seven|seventy|several|shall|shan.?t|she|she.?d|she.?ll|she.?s|shortly|should|shouldn|shouldn.?t|show|side|sidebar|sidebartop|sideways|silver.?site|simply|since|sincere|sith|six|sixty|size|slightly|small|sobeit|soft|solely|some|somebody|somehow|someone|something|sometime|sometimes|somewhat|somewhere|soon|sorry|span|specified|specify|specifying|stand|stark|stately|still|stop|stopwords|strange|strong|such|sudden|sure|surely|swift|syne|system|tags.?|take|taken|taking|tbody|tell|ten|tends|test|text|than|thank|thanks|thanx|that|that.?ll|that.?s|thats|the|their|theirs|them)$
^(themselves|then|thence|there|there.?ll|there.?s|thereafter|thereby|therefore|therein|thereof|theres|thereupon|therewith|these|they|they.?d|they.?ll|they.?re|they.?ve|thick|think|third|thirdly|thirty|this|thorough|thoroughly|those|though|thousand|three|thro|through|throughout|thru|thus|tight|till|timely|title|together|took|top|totally|toward|towards|trackback|tried|tries|trillion|truly|trying|twelve|twenty|twice|two|uncategorized|under|underneath|unduly|unfortunately|unless|unlike|unlikely|until|unto|upon|use|used|useful|user.?profile|uses|using|usually|value|various|version|versus|very|videobarviewRead|videobarviewabove)$
^(view.?user.?profile|view.?replies|viz.?|want|wanting|wants|was|wasn|wasn.?t|way|ways|we.?d|we.?ll|we.?re|we.?ve|weakly|webpage|website|welcome|well|went|were|weren|weren.?t|what|what.?ll|what.?s|whatever|when|when.?s|whenas|whence|whencesoever|whene.?er|whenever|whensoever|where|where.?er|where.?s|whereafter|whereas|whereby|wherein|wheresoever|wherethrough|whereunto|whereupon|wherever|whether|which|while|whiles|whilst|whither|whithersoever|who.?d|who.?ll|who.?s|whoever|whole|wholly|whom|whomever|whose|why.?s|widget|widgetinfo|widgetmanager|will|willing|with|within|without|won.?t|wonder|word.?wrap|worldly|worse)$
^(would|wouldn|wouldn.?t|xhtml|yeah|you|you.?d|you.?ll|you.?re|you.?ve|your|yours|yourself|yourselves|your.?title.?tag|zero|advanced|advanced-search|article-reprint|close|community|community-forum|create-a-group|forward|google|google-groups|google-home|highlight-this-message|homecommunity|join-this-group|leave-a-response|local|log-in|message-listing|next-thread|options|previous-thread|print-this-message|privacy|privacy-policy|quantcast|recent-archives|refresh|service|sign-in|sign-up|status|subscriptions|terms-conditions|thread-options|user-search|wordpress|default-aspx|show-original|navigation|.{25,})$

# Pattern 2: Kill any tags that end with this pattern
(!|-|-.|-.|-at|-with|-to|-by|-on|-without|-an|-a|-and|-as|-if|-like|-so|-when|-what|-where|-why|-who|-until|-unless|-for|-both|-but|-into|-on|-off|-from|-his|-her|-we|-us|-the|-what|-our|-your|-her|-his|-are|-have|-has|-that|-you|-powered|-will|-there|-here|-tag|-width|-such|-using|-with|-these|-this|-those|-need|-their|-she|-he|-other)\s*$

# Pattern 3: Kill any tags that start with this pattern
^(0px|about|add-|are-|also-|is-|-|.-|powered.?by|advertise.*|and|contact|site|submi|skip|reset|with|into|permalink|bookmark|forgot-|related-|posted-by-|view-|has-|have-|than-|there-|for-|than-|keyword|74|76|sign-in|marked-as|redirect|width|img-|div-|report-|share-|which-|what-|where-|when-|who-|why-|input|and-|but-|had-|did-|ever-|hello-|here-|http-|www-|index-|you-|while-|gave-|give-|although-|were-)

# Pattern 4: kill any tags that contain this pattern anywhere in the tag
[^\W]*(?:s(?:h(?:emale|it)|item|tats|ubscribe)|c(?:o(?:unter|mment)|lick|asino|ialis|unt)|r(?:(?:s|ingtone)s|eply)|er(?:ror|ection)|l(?:ogin|evitra)|blo(?:gs?|etura|wjob)|u003[dc]|p(?:r(?:iotupiliko|escription)|eni(?:s|le)|(?:harmac|uss)y)|a(?:d(?:ipex|min)|sdf)|4(?:yo)?u|vi(?:agra|oxx)|terms.?of|mail|holdem|xanax|zolus|(?:fu|doublecli)ck|(?:whor|nud)e)|39s|trackback[^\W]*

# More patterns
arial|index|span|share-this|categories|img-src|1px|no-repeat|added|recent-post|added|favorites|views|1px|this|000|that|any|quot|read-the-full|full-post|was|would|cannot|caused|could|those|them|this|than|when|where|who|just|they|can|by|favorite|about|filed-under|get-adobe|gmt|however|got|previous|next|-.*ly
